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Abstract — A capacity-achieving scheme based on polar codes is 
proposed for reliable communication over multi-channels which 
can be directly applied to bit-interleaved coded modulation 
schemes. We start by reviewing the ground-breaking work of 
polar codes and then discuss our proposed scheme. Instead of 
encoding separately across the individual underlying channels, 
which requires multiple encoders and decoders, we take advan- 
tage of the recursive structure of polar codes to construct a 
unified scheme with a single encoder and decoder that can be 
used over the multi-channels. We prove that the scheme achieves 
the capacity over this multi-channel. Numerical analysis and 
simulation results for BICM channels at finite block lengths 
shows a considerable improvement in the probability of error 
comparing to a conventional separated scheme. 

I. Introduction 

Polar codes, introduced by Arikan in [1], are the first 
provably capacity achieving codes for the class of binary-input 
symmetric discrete memoryless channels with low encoding 
and decoding complexity. Construction of polar codes is based 
on a phenomenon called the channel polarization. It is proved 
in [1] that as the block length grows large the channels seen 
by individual bits through a certain transformation called the 
polar transformation start polarizing: they approach either a 
noise-less channel or a pure-noise channel. This suggests the 
construction of polar codes as follows: put the information bits 
over the set of good bit-channels i.e. almost noise-less chan- 
nels while fix the input to the rest of the bit-channels to zeros. 
The set of underlying bit-channels can be sorted from good to 
bad based on their corresponding Bhattacharyya parameter. 
Then the bit-channels with Bhattacharrya parameters below a 
certain threshold are called good while the rest are called bad. 
In fact, the unequal error protection seems to be an inherent 
property of polar codes. Therefore, it is natural to exploit this 
property to design codes for bit-interleaved coded modulation 
channels. 

Bit-interleaved coded modulation (BICM) schemes can be 
modeled as a multi-channel consisting of several underlying 
binary-input channels over which the coded bits are transmit- 
ted. Therefore, it is of great interest to design codes that are 
efficient when used over a certain set of different channels. 
However, it is not straightforward how to design a polar 
code to be transmitted over a set of channels rather than a 
single channel, how to establish polarization theory in this 
case etc. One immediate solution is to encode the information 
separately over the underlying channels using polar encoders 
corresponding to each channel. However, from a practical 



point of view, it is desirable to have only one encoder and 
one decoder to reduce the hardware complexity. Also, by 
combining all the channels together and sending one single 
codeword, efficiently designed for the corresponding multi- 
channel, we achieve a better trade-off between the rate and 
probability of error in the whole scheme. We call this unified 
polar-based scheme for transmission over multi-channels as 
compound polar codes. 

The rest of this paper is organized as follows. In Sectionjll] 
we provide some required background on channel polarization 
and construction of polar codes. In Section|IlIj we propose 
compound polar codes for 2-multi-channels and prove that 
it achieves the capacity of the combined channel. We also 
discuss how to extend this to /-multi-channels. In SectionlTV] 
we present the simulation results for AWGN channel with 
BICM 16-QAM constellation. We close the paper by men- 
tioning some directions for future work and open problems in 
Section|V] 

II. Preliminaries 

In this section we provide an overview of the groundbreak- 
ing work of Arikan [1] and others [2]-[4] on polar codes and 
channel polarization. 

The polar code construction is based on the following 
observation by Arikan which is called channel polarization. 
Let 
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The i-th Kronecker power of G, which is denoted by G®'\ is 
defined by induction i.e. G®^ = G and for any i > 1: 



Next, for all N = 2", let us define the Arikan transform matrix 
Gat RnG®''\ where is the bit-reversal permutation 
matrix defined in [1, Section VII-B]. Now consider a block 
of N uniform i.i.d. information bits Ui, U2, ■ . ■ , Un, denoted 
by , and multiply it by Gat to get the vector Xf. X^'s 
are transmitted through N independent copies of a binary 
input discrete memoryless channel (B-DMC) W. The output 
is denoted by Y-^ . The transformation from to Y-^ is 
called the polar transformation. 

A finite-input and finite-output discrete memoryless channel 
is denoted by a triple W}, where Jl^, 'W are finite sets. 

W is the transition probability matrix which is an |^| x 



matrix. For any x e ^ and y e ^V, W[x, y], conventionally 
written as W{y\x), is the probability of receiving y e?^ given 
that X e ^ was sent. With a slight abuse of notation, we 
simply write W to denote the channel (^S(^ ,'3/ , 

Definition 1: A binary-input discrete memoryless channel 
(B-DMC) W : {0,1} ^ is called symmetric if there exists 
a permutation -k : 'W ^ 'W with tt^^ = tt such that for any 
ye^r, W^(y|0) = T^(^(y)|l). 

For any B-DMC W , the Bhattacharyya parameter of W is 

z{w) 2 ^/w{y\mm 

It is easy to show that the Bhattacharyya parameter Z{W) is 
always between and 1. Intuitively, Z{W) shows how good 
the channel W is. Channels with Z{W) close to zero are 
almost noiseless, while channels with Z{W) close to one are 
almost pure-noise channels. This intuition is clarified more by 
the following inequality. It is shown in [1] that for any B-DMC 
W, 

1 - i(w) z{w) ^/i-i(w)^ (2) 

where I{W) is the symmetric capacity of W. 

Let denote the channels that results from N indepen- 
dent copies of W i.e. the channel ({0, 1}^, W^} given 

by N 

W^{y^\x^) Yl^imlx,) 

i = l 

where xf = {xi,X2, ■ ■ ■ ,xn) and yf" ^(2/1 , 2/2 , 
Then the combined channel ^{0, 1}^, iV^, is defined with 
transition probabilities given by 

(4) 

This is the channel that the random vector {Ui,U2, ■ ■ ■ , Un) 
observes through the polar transformation defined 
earlier Arikan [1] also defines the i-th bit-channel 
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<{0,l},^^^x{0,l} 
follows. Let u\ 
length i. For i - 
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for i = 1,2,..., TV, as 
(ui, M2, . . . ,Ui) denote a binary vector of 
= 0, this is the empty string. Then 
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It can be shown that 



(5) 

jY [y'{ ,ui is indeed the 

probability of the event that (Fi, I2, • • ■ , i^Tv) = Vi and 
, U2, • • ■ , Ui-i) = given the event Ut = Ui, provided 
is a priori uniform over {0, 1}^. Intuitively, this is 
the channel that bit Ui observes under Arikan's successive 
cancellation decoding, described later 

The N bit-channels are partitioned into good channels and 
bad channels as follows [2], [3]. Let [A^] =^ {1, 2, . . . , TV} 
and let /3 < Vj be a fixed positive constant. Then the index sets 
of the good and bad channels are given by 
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Theorem 1: [1], [2] For any binary symmetric memoryless 
(BSM) channel W and any constant (3 < Vj we have 

lim ^ CiW) 

N->ao N 

Theorem[T| readily leads to a construction of capacity- 
achieving polar codes. The idea is to transmit the information 
bits over the good bit-channels while fixing the input to the 
bad bit-channels to a priori known values, say zeros. Formally, 
each subset A of [N] of size \A\ = k specifies a polar code 
^^{A) of rate k/N. Cn{A) is actually a linear code with 
length N and dimension k. The generator matrix of Cn{A) 
is a fc X iV matrix that consists of rows of Gat corresponding 
to the elements of A. 

Arikan also introduces the successive cancellation decoding 
for polar codes which leads to the following key theorem on 
the encoder-decoder pair of polar codes. This theorem is (the 
second part of) Proposition 2 of Arikan [1]. 

Theorem 2: Let Vt^ be a BSM channel and let A be an 
arbitrary subset of [N] of size |^| = fc. Suppose that a 
message U is chosen uniformly at random from {0, l}*^, 
encoded as a codeword of Cn{A), and transmitted over W. 
Then the probability that the channel output is not decoded to 
U under successive cancellation decoding satisfies 



Pt{U^U} ^ Y.Z{W^^) 
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Corollary 3: For any /? < Y2 and any BSM channel W, the 
polar code of length N associated with the set of good bit- 
channels gN{W,/3) defined in ^ approaches the capacity 
of W. Furthermore, the probability of frame error under 
successive cancellation decoding is less than 2^^ . 

III. Compound Polar Codes for multi-channels 

In this section, we start with explaining the model for multi- 
channels. Then we describe our compound polar construction 
for the case of 2-multi-channels. We extend compound polar 
code to the case of I parallel channels and prove the capacity- 
achieving property. Also, the successive cancellation decoding 
and its complexity for compound polar codes is discussed. 

A. Multi-channels 

We consider the following model for a multi-channel con- 
sisting of several binary-input discrete memoryless channels. 
Let Wi : ^ ^ ^, for i = 1, 2, . . . , /, denote the I given B- 
DMCs (indeed ^ = {0, 1}). Then the corresponding /-multi- 
channel (W1.W2 . . . W^/) : ^ ^1 X ^2 X • • • X ^ is another 
DMC whose transition probability for x{ = {xi,X2, ■ ■ ■ ,xi) e 
and = (yi, 2/2, ■ ■ • , 2/i) 6 ^1 X ^ X • • • X ^ is given 

by: 

I 

{W1.W2 . . .Wi){y{\x[) =YlW,iy,\x,) 

1=1 

In fact, each binary sequence of length I is transmitted through 
this multi-channel in such a way that each bit is transmitted 
over one of the I channels. In general, for any TV which is 
a multiple of /, a sequence of iV bits is transmitted over this 



multi-channel in such a way that each channel carries N /I bits. 
It is known to both transmitter and receiver that which channel 
carries which bits in the sequence. An arbitrary interleaver 
and deinterleaver can be employed in the transmitter and the 
receiver and thus, the ordering of channels does not matter 

In this section, we first explain a straightforward scheme 
which encodes and decodes separately over the underlying 
channels. Then we discuss our proposed scheme of compound 
polar codes and prove the channel polarization theorem for this 
scheme. 

B. A straightforward construction 

One straightforward solution for constructing polar code 
over a multi-channel is to encode the information separately 
over the underlying binary-input channels using polar encoders 
corresponding to each channel. Suppose that a set of I channels 
Wi , W2 , ■ . ■ ,Wi is given. We want to construct a scheme of 
length N and rate R for transmission over this set of channels 
such that all the channels are used equally N/l times. Also, 
we require that R approaches the average capacity of all the 
channels as N goes to infinity. For i = 1, 2, . . . , Z, we construct 
a polar code of length N/l with rate Ri to be transmitted over 
Wi. Let 

R= ^{Ri + R2 + ■■■ + Ri) 

Then the scheme of length N and rate R is as follows. Given 
the input sequence of NR bits, split it into I chunks of size 
NRi/l, for i = 1,2, ... ,1. Then encode the i-th chunk using 
the i-th polar encoder and transmit the encoded sequence over 
Wi. There are also I separated decoders at the receiver to 
decode the output of each of the channels separately to get 
the i-th transmitted chunk. We call this scheme the separated 
scheme. Assuming that all the underlying channels are BSM 
channels, the rate Ri of the constructed polar code over 
Wi approaches the capacity of Wi for all i's. Therefore R 
approaches the average of the capacities of Wi's. 

C. Compound polar transformation over 2-multi- channels 

In this section, we propose our unified scheme for construc- 
tion of polar codes over 2-multi-channels. From a practical 
point of view, it is desirable to have only one encoder and one 
decoder to reduce the hardware complexity. Also, by combin- 
ing all the channels together and sending one single codeword, 
efficiently designed for this multi-channel, we achieve a better 
trade-off between the rate and probability of error in the whole 
scheme. 

Suppose that a multi-channel with two constituent B-DMC 
channels Wi and W2 is given. The building block of our 
compound polar transformation is shown in Figure[T] 

We actually combine the two channels at the first step. 
Then the recursion is applied to this combined channel exactly 
same as polar codes. More precisely, let T-L denote the channel 
with input ui and U2 and output yi and 2/2 as shown in 
FigurefT] Then there exist a permutation vr, such that applying 
to 2"-i independent copies of Ti. is equivalent to the 
transformation shown in Figure[2] The permutation block tt is 
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Fig. 1 . The proposed building block of our scheme 



designed in such a way that the first half of encoded block is 
transmitted through Wi and the second half through W2. The 
whole transformation from to Y^^ is called the compound 
polar transformation. 



Ul 



U2 



UN 



! 

XN 



TT 



2-multi 
channel 



^-1 



m 

VN 



Fig. 2. The proposed scheme for 2-multi-channels with length TV 



D. Construction of capacity achieving compound polar codes 

The building block shown in Figure[T]can split into two bit- 
channels by generalizing the definition of channel combining 
suggested in [1]. Suppose that ui and U2 are samples of 
two independent uniform binary random variables Ui and 
U2, respectively. Notice that by applying the chain rule to 



I{Ul,Y^) we get 

liUlY^) 



I{Ur,Yl) + I{U2:Y^\Ui) 
I{Uv,Y,^) + I{U2;Yl,Ui) 



where the last equality follows since Ui and U2 are assumed 
to be independent. The term I{Ui,Yi) can be interpreted as 
the mutual information of the channel between Ui and the 
output Y^, where U2 is considered as noise. Let us denote 
this channel by Wi [*] Formally, for any two B-DMCs 
Wi : ^ ^1 and W2 : ^2 (indeed ^ = {0, 1}), 

let Wi H W2 : ^ ^ ^1 X ^ denote another B-DMC whose 
transition probability for any {y 1,1/2) e x and ue ^ 
is given by 

Wi\BW2{yi,y2\u) = IY, Wi{yi\u®x)W2{y2\u) (9) 

xe.9: 

Similarly, the term I{U2',Yi ,Ui) can be interpreted as the 
mutual information of the channel between U2 and Y^ when 
Ul is available at the decoder Formally, for any two B-DMCs 

Wi : ^ ^1 and W2 : J:" ^ ^2, let Wi®W2 : ^ x 



'3/^2 y- X denote another B-DMC whose transition probability 
for any 1/2) 6 x and a;, u e ^ is given by 

Wx®W2{yi,V2,x\u) = ^Wi{yi\u®x)W2{y2\u) (10) 
The channels Wi ® W2 and Wi ® W2 are depicted in Figure|3] 

Ui 





Wi H W2 Wi® W2 

Fig. 3. The bit-channels of the proposed building block 

The individual bit-channels can be defined for this com- 
pound polar transformation with some modification to the 
definition of bit-channels for original polar transformation in 
Let W denote the channel from to in Figureji] 
Then the individual bit-channels (Wi.W2)^^^ are defined as 



follows. For i = 1,2, 



{W,.W2)^j!,\y^,u\-'\u,) 
1 



dof 



2JV-1 



(11) 



2 w^(yf|K"\u„<i); 

We also define the good bit-channels and bad bit-channels 
same as before i.e. for any /3 < Vj and N = 2^ 

gN{Wi,w2,p) 



BNiWi,W2,l3) 



dcf 



We show that the compound polar transformation shown in 
Figure|2] is equivalent to two separated polar transformations 
of length N/2 for Wi H W2 and Wi ® W2 independently. 
Therefore, the channel polarization theorem can be established 
for this proposed transformation accordingly. This is proved 
next. The following lemma is needed to establish the proof of 
Theorem|5] 

Lemma 4: For any two BSM channels Wi and W2, 

C{Wi H W2) + C{Wi ® W2) = C{Wi) + C{W2) 
Proof: Let Ui and U2 be two independent uniform binary 
random variables. Let Yi and I2 be the outputs of the channels 
Wi and W2 with inputs Ui + U2 and U2 respectively, as 
depicted in Figure[T| Then 

C{Wi) + C{W2) = /(C/i + U2;Y{) + I{U2;Y2) = I{Ul,Y^) 
= I{Ux;Y^) + I{U2;Y^\Ui) 
= I{U^-Y^)+I{U2;Y^,Ui) 
= C{Wi S W2) + C{Wi ® W2) 



where we used the fact that Wi, W2, Wi H W2 and Wi ® 
W2 are all symmetric and therefore, the symmetric capacity 
is equal to the capacity for each of them. ■ 
Theorem 5: For any two BSM channels Wi and W2 and 
any constant /3 < V2 we have 



lim 



\gNiWi,w2,i3)\ 

N 



1 



(C(T4^i)+C(W^2)) 



Proof: For simplicity let W denote Wi H W2 and W 
denote Wi ® W2 ■ By induction on n = log N, we can show 
that for 1 i N/2 



{Wi.W2)^^^ 

and for N/2 < i N 



N/2 



(12) 



(13) 



The base of induction is clear by definition. The induction step 
follows by the recursive structure of the polar transformation. 
If N is large enough, then we can pick /3' such that /3 < 

/?' < 1/2 and 



2-(JV/2)'= 

N/2 



< 



N 



Then definitions of good bit-channels given in (|6]l and (12i 
together with ([12]) and ( fTSj l imply that 



\gN{W,,W2,f3)\^ gN/2{W,l3') + gN/2{W,l3') 
Then by Theorem[T| and Lemma|4] 

\gNiWi,w2,i3)\ 



(14) 



lim 



N 

Gn/2{W,P') 
N 



gN/2{w,i3') 

N 



5= lim h lim 

N^oo N N^oo 

= l{CiW) + C{W)) = l{CiW^)+CiW2)) 



On the other hand we have 



N 



N 



{l{Wi) + I{W2)) = 2 l{iWi.W2) 



N ) 



(15) 



In ) 



> \Qn{W^.W2,P)\ 



.TV" 



(16) 
(17) 



(15 1 is by the chain rule on the mutual information between 



the input and output of the scheme shown in Figure|2] ([16]) 
follows by (]2ji. ( [TT] ) holds by definition of the set of good 
bit-channels SAr(VFi, VF2, Therefore, 

which completes the proof of theorem. ■ 



The encoding of our scheme is similar to that of polar codes. 
Let k = \Qn {Wi , W2 , /3) I • Then the polar code associated with 
the set of good bit-channels Qn{Wi, W2, (3) is a {k, N) code. 
The positions corresponding to the indices in Gn{Wi, W2, f3) 
carry the information bits and the rest of input bits are frozen 
to zeros. Then the following theorem and the corollary follows 
similar to Theorem|2] and Corollary [3] proved in [1]. 

Theorem 6: Suppose that a message U is chosen uniformly 
at random from {0,1}*^, encoded using polar code associated 
with a set ^ c [TV] and transmitted over Wi and W2 as 
described in Figure|2] Then the probability that the received 
word Y is not decoded to U under successive cancellation 
decoding satisfies 



N ) 



ieA 



Corollary 7: For any /3 < 1/2 ™d any two BSM channels Wi 
and W2, the polar code of length N associated with the set of 



good bit-channels GNiWi, W2, l3) defined in (12 1 approaches 
the average of the capacities of Wi and W2- Furthermore, 
the probability of frame error under successive cancellation 
decoding is less than 2" 



E. Extending the construction to l-multi- channels 

In this section, we generalize the compound structure pro- 
posed in the foregoing subsection to the case of ^-multi- 
channels. Suppose that a multi-channel consisting of set of 
I B-DMCs Wi,W2, ■ ■ ■ ,Wi\5 given. We fix an Z x ^ invertible 
matrix Gq as the initial matrix. Then the building block 
corresponding to Go is shown in Figure]?] In this figure, 
x\ = u{.Gq and then xi,X2, ■ ■ ■ ,xi are transmitted through 
Wi, W2, . . . ,Wi, respectively. 
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yi 



Fig. 4. The building block for the general case of (-multi-channels 

In [4], a general transformation G®" is considered, where 
G is an arbitrary I x I matrix with ^ > 3. A necessary 
and sufficient condition on G is provided which guarantees 
polarization for any BSM channel. It is proved in [4] that if G 
is an invertible matrix, then polarization happens if and only if 
G is not upper triangular As a result, we can pick the matrix 
Go to satisfy this condition. Then the results of the forgoing 
subsection can be easily generalized to compound schemes of 
length Z" transmitted over an /-multi-channel with transform 
matrix G®" . However, the problem is that the successive 
cancellation decoder for this scheme is not easy to implement 
and decoding complexity grows by a factor of 2'. 



In the case that I = 2™ is a power of two, we pick the initial 
matrix for the building block to be Go = G®™, where G is the 
base 2x2 polarization matrix. Then the polarization matrix 
G®" is applied to this building block resulting in a compound 
polar code of length = 2"+™. The advantage of picking 
this particular Go is the low complexity decoding algorithm. 
In fact, the successive cancellation decoder with complexity 
0{N log A^) that is used for the Arikan's polar code of length 
N can be applied to this compound code as well. 

F. l-compound polar codes with low complexity decoder 

In this section, for an arbitrary number of constituent chan- 
nels I, we propose a scheme which enjoys the low complex 
0{N\ogN) decoder 

For n 5= 0, we construct the general scheme with length 
N = L2" as follows. We apply the polar transformation G®" 
to the proposed building block in Figure]?] The block diagram 
of the proposed transformation is shown in Figure]5] In fact, 
the input sequence Ui is multiplied by Go®G®'^. We design 
the permutation tt in such a way that the first N /I encoded 
bits are transmitted through N /I independent copies of Wx, 
the second N /I encoded bits are transmitted through N /I 
independent copies of W2 etc. This is the general compound 
polar transformation. 



Ui 
U2^ 

UN 



Go ® G®" 



Xi 



X2 



Xn 



/-multi 
channel 



yi 

2/2 

2/JV 



Fig. 5. The compound polar transformation for the general case of Z-multi- 
channels 



Suppose that Wi,W2, ■ . ■ ,Wi are BSM channels. Then it 
can be shown that the compound polar transformation in 
Figure]?] is equivalent to I separated polar transformations 
of length 2" for I certain bit-channels corresponding to the 
building block, independently. Then the results of Theorem]6] 
and Corollary]?] can be generalized to this compound scheme 
accordingly. 

The original successive cancellation (SC) decoder of polar 
codes invented by Arikan in [1] can be extended to the 
compound polar codes with some small modifications in a 
straightforward way. Let N = 12" and suppose that is the 
vector that is multiplied by Go ® G®" and then transmitted 
over independent copies of Wi , W2 , . . . ,Wi as shown in 
Figurep] Let denote the received word. For i = 1, 2, . . . , A^ 
if is not a good bit-channel, then the decoder knows 

0. 



that i-th bit Ui is set to zero and therefore, Ui 



Otherwise, the decoder computes the likelihood L^j of Ui, 
given the channel outputs yi and previously decoded u^{~^. 



Then it makes the hard decision based on L 



N 



The likelihood functions L^-* can be computed recursively 
similar to what Arikan proposed [1]. The only question is 
how to initiate the SC decoder for n = 0, when N = I. A 
naive way of computing transition probabilities of constituent 
bit-channels results in the complexity 0(2'). The recursive 
steps can be done using Arikan's refined SC decoding al- 
gorithm with complexity 0{N\ogN/l). Therefore, the total 
complexity is 0(Af(logiV— log/ + 2')). As N grows large, the 
dominating term is N log N and therefore the total complexity 
of SC decoding algorithm is 0{N log N). 

IV. Simulation results 

Transmission over AWGN channel with 16-QAM BICM 
is considered. In 16-QAM BICM, there are actually two 
constituent binary-input channels. Among the 4 bits in each 
symbol, two of them go through one channel denoted by Wi 
and the other two goes through the other channel denoted by 
W2- In the constellation that we are using, the mapping is such 
that bits 1 and 2 in each symbol goes through Wi and bits 3 
and 4 goes through W2- The conventional way of modulating 
a codeword of length N is to split into N/A sub-blocks of 4 
consecutive bits each and map them into N/A symbols. 

For the code construction, we take a numerical approach to 
estimate to probability of error of the individual bit-channels. 
For simulation, the block length is fixed to 2^" = 1024 and the 
rate to 1/2. Transmission over AWGN channel is considered. 
In the separated scheme, we split the bits into two groups 
based on the channel that they observe. Then we construct two 
polar codes for Wi and W2 separately. This should be done 
in such a way that the total rate is 1/2. Since the two channels 
are different, we have to figure out how to assign the rate to 
be transmitted on each channel. We fix the Ei,/No = 5dB and 
then numerically estimate the bit-channel probability of error 
for each of the channels. We pick the rates such that the total 
probability of error is minimized. It turns out that the rate 0.62 
on the stronger channel Wi and 0.38 on the weaker channel 
W2 minimizes the total probability of error at 5 dB. We use 
the same scheme for all SNRs. 

For the compound scheme, we use an interleaver to guar- 
antee the right ordering of the transmitted bits as depicted in 
Figure|2] In fact, the interleaver switches the second and third 
bit in each symbol with each other. There is a deinterleaver at 
the decoder which does the same to the channel outputs. The 
comparison between the two methods is shown in Figure|6] 
As we can observe, our compound scheme is about 1.5dB 
better than the separated scheme at moderate SNR's. As 
SNR increases, the gaps become larger and the curves start 
diverging. 

V. Conclusions and Future Work 

In this work, we proposed a compound polar-based scheme 
to be used over multi-channels. We extended the channel 
polarization theorem to this case and proposed the compound 
polar code construction. We also provided simulation results 
at finite block lengths for BICM channels. There are a couple 
of open problems regarding the proposed construction. One 
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Fig. 6. Performance of the proposed scheme over 16-QAM 



is regarding the code construction over BICM channels. We 
took a numerical simulation-based approach to estimate the 
bit-channel probability of errors. An alternative way is to 
extend the Tal-Vardy method [5] for efficiently constructing 
polar codes to the case of BICM channels. This is left as a 
future work. Another question is what is the best ordering of 
the channels in the general building block in Figure|4] It turns 
out that for the case of 2-multi-channels the ordering does not 
matter However, for general I different orderings may result in 
different polarization rates. The question is how to characterize 
the polarization rate in terms of the building block and how 
to pick the best ordering. The second open problem is about 
choosing the initial I x I matrix Gq- Which Gq results in the 
best performance at finite block lengths? One strategy is to 
pick Go that maximizes the polarization rate among alH x Z 
channels as characterized in [4]. However, since Go is only 
used in one level of polarization in the compound polar code, 
this is not necessarily the best choice. The answer to these 
questions will help to design more efficient schemes at finite 
block lengths. 
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